CDS

Accession Number TCMCG068C06334
gbkey CDS
Protein Id KAG5574205.1
Location join(234758..234883,235100..235270,235609..235874,235956..236046,236178..236309,236955..237097,237342..237470,237997..238123,238349..238548,240967..241042,241178..241296,241395..241430,241536..241652,241862..241981,242333..242468,242734..242904,243550..243702,243800..243906,244404..244494,244602..244743,244849..245008,245167..245283,245376..245502,245910..246149)
Organism Solanum commersonii
locus_tag H5410_054339

Protein

Length 1098aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA655804, BioSample:SAMN15755581
db_source JACXVP010000011.1
Definition hypothetical protein H5410_054339 [Solanum commersonii]
Locus_tag H5410_054339

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCAATTTCAGATGAAAGACAAAGGATTATGCCAACGGCCCATGATCGTGAAGTGGGCCAAAAACTTGATTATAGTGAAGCTGTTCTTCTTACAAACCCAAACAATTCTTTCATCAAAGGAGAGGTCGATGACAAATATCAATATTCTTGTGAAAACAAGGATAACCGGGTCCATGGATGGATCAGCCCGACTCCACGAACCGGGTTTTGGATGATTACACCAACCGATGAGTTCAGAACGGGTGGGCCCGTCAAACAAGACCTTACTTCTCATACCGGCCCAGTAAATCTAAATATGTTCTTTAGTACACATTATGCTGGAGAAGTATTGGGACTAAAATTTAGAAATGGAGAGCCCTGGAAAAAAGTTTTTGGCCCTGTTTTTGTCTACATGAACTCACTTTCACCTGATGAGCCAGACACTCTCACACTTTGGACCGATGCAAAAGAACAAATGTTCGTTGAGACTGAAAATTGGCCATATAATTTCCCTCTTTCGGAGGATTATGCTCGAGCTGATCAACGTGGTATTGTAAGTGGCAGATTGCTCGTTCGAGATAGATATGTTAGCCAAAGTCTTATGATTGCAAATTCAGCTTTTATTGGACTGGCTGCTCCTGGAAATGTTGGATCTTGGCAATTAGAAAATAAGGCTTATCAATTTTGGACTCAAACAGATAGTGAGGGCTATTTCTTGATCAAGAATGTTATTCCAGGGAACTATAGTTTATATGCTTGGGTGCCTGGATTTGTTGGAGATTACATGTACGATCCGTACATTATTAAACTCTTTATATATGATGCTCCAAGAAATGGTCCAACATTGTGGGAAATTGGAATTCCTGATAGAACAGCTGCTGAATTCTTCATCCCTGATGCACAACCAAAACTCTTAAATCAATTATATGTTGTACATAATCAAGAAAGGTATAGGCAATATGGATTATGGGATAGGTATACAGAGCTATACCCTGATGATGATTTGGTGTTTACTATTGGATTCAGTAACTATCAAACAGATTGGTTCTTTGCTCATCTCAATAGATATTTTTACAATGATGATGGAAACAAGACTTATGCACCAACAACATGGCAAGTTCTATTTGATCTTGAAGATGTTGATCAATCATCAAACTATACTCTTCAACTAGCATTAGCCTCAGCACATGAAGCTGAATTGCAAGTTCGATTCAATGATCCAGAAATCGATGCTCCACACTATTCGACAGGGTTGATAGGGAAGGATAACGCGATAGCAAGACATGGTATACATGGAATATACAGACTATATACTATAAATGTTCCTGGCTCTCTCCTTGGTTTTGGAACAAATGTTATGTATATCAAGCAAAGTAGAGGTGATATGCCTTTTAGAGGACTTATGAATACTTCAGCCAAAGAGAATCCCAAATTGCAACAAGTTTCTCCTCCCGTCACATTGACGGTCACAAGTCAATACGTGGTAATTGATAATGGCATTGTTCAACTTTCTTTAACTAATCCTACTGGCTCTATGTTTGGAGTTAAATATAATGGTATTGATAATCTCCTAGAACCTTTACAAGAAACACAAAGAGGATATTGGGATACTATGTGGAATGGTAAATTTGACACGCTTTTAGCATCAAAATTTAGTGTGATTGCACAAGATGATAACAAAGTTGAAGTTTCTTTCCAAAAAATACATGATCCTTTGAATGGTAACAATCCTCCTCTAAATGTTGACAAAAGGTATGTGATGCTACGTGGGAGTTCTGGATTTTATTCATATGGAATATTTCAACATTTAAAAGGATGGTCAGCTGTAAATTTGGATGAAGCTAGAATTGCTATTAAGCTCAGTAAATCACTGTTTCATTTTATGGCAATATCAGACGATAGACAAAGGATAATGCCAACAGAAGAGGATCGTGCAAGTGGTCAAACACTAGATTATAGAGAAGCTGTTAAAATTACAAATCCATCCAACCCTAGACTCAAAGATGAGGTTGATGACAAGTACCAATATACAGATGAGATTAAAAATATTAAAGTTCATGGGTGGATAAGTGATACTCCACACATGGGGTTTTGGGTAATTTCACCAAGTTATGAATATTGTAATGGTGGACCTATGAAGCAAGATCTTACCTCTCATGTTGGTCCAACATCTATGGCTATATTTTTCAGTGGGCATTATGCAGGGCCACAATTAGGAGTTTCATTAACAAATGGAGAAGCATGGACTAAAGTTTTTGGGCCTGTATTTTTCTATGTTAATTCAGATTCTTCCAATGATCATACCATACTTTGGGAAGATGCTAAAAGACAGATGAATGAAGAAACAAATAAATGGCCTTATGATTTTCCTGCATCAATAGATTATCTTCATGCAAATCAACGTGGCTCAGTTAGTGGTCAATTAATGGTTCATGACTGGTACATAAACGTAGATCCTTTGCCTGCATTAAATGCATATATTGGACTTGCTGAACCTGGACTTGTTGGATCTTGGCAAAGTGAAACCAAGGGTTATCAATTTTGGACTCAAATTGATGATTCTGGTAATTTCAAGATAAATAATGTTAGACCTGGAATTTATGGAGTATATTCTTGGGTCCCTGGAGTTATGGGAGATTACAAATTCTCTTCTTATATAACCGTTACGCAAGGAAAGGACACATACATAGGTCAAATTATATTTGAAGCTCCAAGAAATGGTCCACCACTATGGGAAATTGGATTTCCAGATAGAACTGCTAATGAATTTTTTATACCTGATCCATTGCCTGGTCTTCAAAATTATTTGTATACTAACACTACCATACATAAGTTTAGACAATATGGTTTATGGGATCGTTACACTGATTTATATCCTAATGGAGATTTAGTGTACAAAATTGGTGTTAGTGATTTTAGAAAAGATTGGTTCTTTGCCCATGTAAATAGGAGAAATAAGGATAGGAGTTTTTCAGCAACAACATGGCAAATTGTATTTGATGTTAAAAATGTGGATTCTAGTGGCACTTATCATCTTCATATAGCTTTGGCTTCTGCATCTTATGCTCATTTACTAGTGTGGATAAATACTCCATCAAAACCAAGACCATGGTTTGATAGTTCACCAATTGGGATGAGTAATGCAATAGCAAGACATGGAATTCATGGATTATATATGACTTTCGATATTGAATTTCCAGGAACACAACTTTATATTGGTGAAAATATAATATATTTGAAGCAAGCATCAGTTTATGGCCCTTTTACTGGACTCATGTATGATTATATTCGTCTCGAAGGACCTTCAACAAAATAG
Protein:  
MAISDERQRIMPTAHDREVGQKLDYSEAVLLTNPNNSFIKGEVDDKYQYSCENKDNRVHGWISPTPRTGFWMITPTDEFRTGGPVKQDLTSHTGPVNLNMFFSTHYAGEVLGLKFRNGEPWKKVFGPVFVYMNSLSPDEPDTLTLWTDAKEQMFVETENWPYNFPLSEDYARADQRGIVSGRLLVRDRYVSQSLMIANSAFIGLAAPGNVGSWQLENKAYQFWTQTDSEGYFLIKNVIPGNYSLYAWVPGFVGDYMYDPYIIKLFIYDAPRNGPTLWEIGIPDRTAAEFFIPDAQPKLLNQLYVVHNQERYRQYGLWDRYTELYPDDDLVFTIGFSNYQTDWFFAHLNRYFYNDDGNKTYAPTTWQVLFDLEDVDQSSNYTLQLALASAHEAELQVRFNDPEIDAPHYSTGLIGKDNAIARHGIHGIYRLYTINVPGSLLGFGTNVMYIKQSRGDMPFRGLMNTSAKENPKLQQVSPPVTLTVTSQYVVIDNGIVQLSLTNPTGSMFGVKYNGIDNLLEPLQETQRGYWDTMWNGKFDTLLASKFSVIAQDDNKVEVSFQKIHDPLNGNNPPLNVDKRYVMLRGSSGFYSYGIFQHLKGWSAVNLDEARIAIKLSKSLFHFMAISDDRQRIMPTEEDRASGQTLDYREAVKITNPSNPRLKDEVDDKYQYTDEIKNIKVHGWISDTPHMGFWVISPSYEYCNGGPMKQDLTSHVGPTSMAIFFSGHYAGPQLGVSLTNGEAWTKVFGPVFFYVNSDSSNDHTILWEDAKRQMNEETNKWPYDFPASIDYLHANQRGSVSGQLMVHDWYINVDPLPALNAYIGLAEPGLVGSWQSETKGYQFWTQIDDSGNFKINNVRPGIYGVYSWVPGVMGDYKFSSYITVTQGKDTYIGQIIFEAPRNGPPLWEIGFPDRTANEFFIPDPLPGLQNYLYTNTTIHKFRQYGLWDRYTDLYPNGDLVYKIGVSDFRKDWFFAHVNRRNKDRSFSATTWQIVFDVKNVDSSGTYHLHIALASASYAHLLVWINTPSKPRPWFDSSPIGMSNAIARHGIHGLYMTFDIEFPGTQLYIGENIIYLKQASVYGPFTGLMYDYIRLEGPSTK